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U 1 Introduction 

^' 

d ', Given a function /i : Fp x Fp x ■ ■ ■ x Fp ^ C, we define the usual expectation 

operator 

Eni,...,n,(/i) := P~''T^m,...,n,e¥Mni, ...,nk). 
^ , We also define, for / : Fp ^ C, the operator 



A(/) := E„,,(/(n)/(n + rf)/(n + 2rf)). 

If / were an indicator function for some set S C Fp, this would give a normal- 
ized count of the number of three-term progressions in S. 
^ ■ In the present paper we establish a new structure theorem for functions 

/ : Fp ^ [0, 1] that minimize the number of three-term progressions, subject 
to a density constraint; and, as a consequence of this result, we prove a fur- 
^ I ther structural result, which can also be deduced from the work of Green [3J, 

■ though only for high densities (Green's result only works for densities exceed- 

ing 1/ log^(p), though perhaps his method can be generalized for this particular 
problem to handle lower densities). 

Before stating the theorem, it is worth mentioning that Green and Sisask 
[5J have shown that sets of high density (density close to 1) that minimize 
the number of three-term arithmetic progressions, are the complement of the 
union of two long arithmetic progressions (actually, their result is stated in 
terms of sets that maximize the number of three-term progressions, but there 
is a standard trick to relate this to the minimizing sets). 

Our main theorem is now given as follows: 

Theorem 1 Suppose that 

/ : Fp ^ [0,1] 
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minimizes A(/), subject to the constraint that 



A(/) > e e (0,1]. 

Then, 

• Let C{n) equal f{n) rounded to the nearest integer, which is therefore 
or 1. Then, 

Ej/H-C(n)| « p{\ogp)-"\ 

So, f must be approximately an indicator function. Furthermore, we get the 
same conclusion if f satisfies IE(/) > 0, and A(/) comes within 0{l/p) of the 
minimal value for this density constraint. 

• There exists a function r : Fp — [0, 1] such that E(r) = IE(/), where A(r) 
is very close to the minimal A(/), specifically 

A(r) = A(/) + O(p-i), 

such that if we let, for some L, 

S := {n G Fp : {r *r){2n) + 2{r * g){—n) < L}, where ^'(n) := r(— n/2), 
then 

T.nHn)-S{n)\ « p{\ogp)-^'\ (1) 

(Please see subsection M . 1\ for an explanation of this part of the theorem.) 

• We have that there exist sets A and B of Wp, with \A\ > p^~°^^^ and 
\B\ > p^^'^ , such that the set for which f is approximately an indicator function, 
is roughly the sumset A + B. More precisely: If we let C{n) denote f rounded 
to the nearest integer, as in the first bullet above, then 

Y.n\{A*B){n)-\B\C{n)\ « p\B\{\og\ogp)-^/\ 

Furthermore, we may take A = C and take B to be a certain "Bohr neighbor- 
hood" B, which is described in the proof of the theorem. 

1.1 A remark on the second part of the theorem 

By ([T]) we see that r is nearly an indicator function for the set S. Let us 
suppose, for the purposes of discussion, that it is exactly an indicator function 
for some set, and let R be this set. Note that R and S must have small 
symmetic difference. 

When does an n G Fp belong to the set S7 To decide this, given n, we let 
A^^i be the number of pairs {x,y) & Rx R such that n, x, y forms an arithmetic 
progression; we let N2 be the number of pairs {x,y) G Rx R such that x, n, y 



2 



is an arithmetic progression; and, we let be the number of such ordered 
pairs where x, y, n is an arithmetic progression. For n to belong to S, we must 
have that 

Ni + N2 + N3 < L. 

Since S and R have small symmetric difference, we see that our r can be 
thought of as enjoying a "local minimal" property: Not only does r minimize 
A(r) up to an error 0{l/p), subject to E(r) > 9, but we can easily decide 
whether n G Fp belongs to R or not, simply by checking to see how many 
progressions pass through the point n, with the other two end-points in R. 
If this count is small enough, then n likely belongs to R (though it certainly 
belongs to S); but, if the count is large, n hkely does not belong to R. 

The most difficult part of this proof that R and 5* are nearly the same, is 
handling those n where A^^i + A'"2 + exactly equals L. 

1.2 Remarks on the third part of the theorem 

One reason to believe the third bullet above is that from the second bullet we 
expect that / is an indicator function for a level set of a "smooth function" 
(r * r){2n) + 2(r * g){—n)] and, as is well known, such level sets must be 
approximately the union of a bunch of translates of a Bohr neighborhood of 
the function, at least when their density is large enough. 

It should be remarked that sumsets are quite special structures, as are 
smooth functions of the type {r*r){2n) + 2{r * g){—n), and only a vanishingly 
small proportion of the subsets of Fp are sumsets or form the support of a 
smooth function; so, the third bullet is saying something fairly non-trivial 
about our minimal /. 

Also, there are loads of other consequences that one can deduce from the 
third bullet. One of these is that, upon decomposing the Bohr neighborhood 
B into a union of arithmetic progressions, one can deduce that C is essentially 
the union of a "small number" of somewhat "long" arithmetic progressions 
("small number" can mean a power of p, say p'^, where c < 1), all having the 
same common difference. 

2 Proof of Theorem [T] 

The proof of this structure theorem depends on a certain function r^, which 
we presently define. 

Definition. Given a subset 5* of a group G, we let r^lS) denote the size of 
the largest subset of S free of solutions to x + y = 2z, x y. In all the uses 
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of r3 in the present paper, G = Z and S = [N] := {1,2, A^}, for varfous 
different values of A^. 

Bourgain [2] has recently shown that 

r3([iV]) « iV(logiV)-2/3, (2) 
and from a result of Behrend [1], we know that for sufficiently large, 

r3([iV]) > Nexp{-c^logN), 
for a certain constant c > 0. 

2.1 Proof of the first part of Theorem [1] 

For this part we will begin by assuming that IE(/) > Kp{logp)~^^^, for as large 
a K > as we might happen to need, since this part of the theorem is trivially 
true otherwise. 

Here we will first show that the minimal / is well-approximated by an 
indicator function; actually, we will prove even more - we will show that if A(/) 
comes within 0{p~^) of this smallest value, subject to the density constraint 
E(/) > 6, then / must be approximately an indicator function. To do this, we 
will require the following proposition, proved in subsection 12. 4[ 

Proposition 1 Suppose that A and B are disjoint subsets of ¥p, such that 
/ : Fp — > [0, 1] has the property 

for neA, f{n) < 1 - e, < e < 1/3, 

and suppose that 

support(/) = AUB. 

Then, for j3 > satisfying 

ef3 > p~^^'^logp, 
there exists a function g : ¥p [0, 1] such that 

and yet 

Aig) < A(/) + 2/3-£V'W^o/4 + 0(p-i), 

where 

Wo := ^a,a+d,a+2deAf{a)f{a + d)f{a + 2d). 
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We also will require the following quantitative version of Varnavides's the- 
orem 



Lemma 1 If S C Wp satisfies \S\ > 2{r-?j{N) /N)p, we will have for any 2 < 
N <p that 

Proof of the Lemma. The proof of this lemma is via some easy averaging: 
We let An denote the set of all arithmetic progressions A OFp having length 
A^. These arithmetic progressions are to be identified by ordered pairs {a,d), 
d 0, where a is the first term in the progression, and where d is the common 
difference. Note that this means we "double count" arithmetic progressions in 
that the progression a,a + d,a + 2d, a + kd is distinct from a + kd, a+ {k — 
l)d, a. 

It is easy to check that each sequence a,a + d,a + 2d, d ^ is contained 
in exactly N^/2 + 0{N) of these A G An'- We have that each three-term 
progression is contained in the same number of A G An, and each A E An 
contains /2+0{N) three-term progressions; hence, if P denotes the number 
of A G An containing a particular sequence a,a + d, a + 2d, we have since there 
are p{p — 1) non-trivial progressions in Fp, that 

p{p-l)P = \AN\{Ny2 + 0{N)), 

whence P = N^/2 + 0{N). 

So, if we let T^i^X) denote the number of sequences a,a + d,a + 2d E X , 
(i 7^ 0, we have that 

US) = {Ny2 + 0{N))-'j:AeAM^^S). (3) 

Next, we need a lower bound on how many A G An satisfy |A fl S"! > r^i^N): 
First, note that for each d G Fp, d ^ 0, there are exactly N arithmetic pro- 
gressions A E An having common difference d that contain a particular point 
a G Fp. So, 

Ea^aJAhSI = EsesE^^-.N = (p-l)iV|^|. 

Let Y be the number of A E An for which |y4 fl 5*1 > rs{N). Then, we have 
{\AN\-Y)r3{N) + YN > {p-l)N\S\, 



which implies 

P-1) 

N-r^iN) 



y , iv-l)N\S\-\AN\rm , ^,_,^\s\-\AN\irm/N). 
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For each of these Y progressions A G we will have that T^i^An S) > 1; 
and so, we deduce from that 

> (P- 1)1^1 -|^^|(r3(iV)/iV) 
^ - Ny2 + 0{N) 

Using the easy to see fact that |^Ar| = p{p — 1), we deduce that if 

\S\ > 2{r,{N)/N)p, 

then 

^ 2p\r,{N)/N) 
3^ > - N^ + 0{N) ■ 

The lemma easily follows on rephrasing this in terms of A(S'). ■ 
Now we let 

A := {neFp : f{n) E [s,1-e]}, 

where e > will be determined later. In order for / to be minimal, from 
Proposition [1] we deduce that we must have that if eP = p~^^^\ogp, then 

p > e^p~^Wo/8 + 0{l/p). 

So, since we trivially have that 

Wo > £VA(v4), 

it follows that 

A(^) < 8£-V^^^logp. (4) 

We would like to now apply Lemma [H to this, but in order to do so, we 
must solve for N such that 

\A\ > 2r-i{N)p/N. 

To this end, we require the bound ([2]) of Bourgain, which implies that if we 
let 

= exp(c(p/|A|)^/^) < p, since \A\ > Kp{logp)~'^^^ , 
then we will have that 

\A\ > j9(logA^)-2/3 > 2r3{N)p/N, 

as we require. 

From this it follows from Lemma [T] that 

A{A) > r3{N)/N'^ > > exp{-3c{p/\A\f/^). 
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It follows now from (jlj) that 



A\ < p\og ^/^(e 



'p), for e > p 




So, if we let C be the function / rounded to the nearest integer (which will 
be either or 1), then for n & A we will have \f{n) — C{n)\ < 1, while for all 
other n we will have \ f{n) — C{n)\ < e. It follows that 



Choosing e = (logp)"^/'^, we deduce that this sum is 0(p(logp)^^/^), just as 
in Bourgain's theorem ([2]). This completes the proof of the first part of our 
theorem. 

2.2 Proof of the second part of Theorem [1] 

Given a function h : ¥p [0, 1], we let 



In order to proceed further, we will require the following proposition. 

Proposition 2 Fix A C Wp, and associate to each a & A a real number 
Wa G [0, 1]. Among all functions h : ¥p [0, 1] satisfying 



E{h) = 7 > TjaeAWa, 

those which minimize A{h) have the property that there exists L > such that 



Enl/H-C(^)l < (£ + (log£^V)"'/')p, fore > p-^/^Hogp. 



h2{n) := h{-n/2) 



and then we define 



Fh{n) := {h*h){2n) + {h*h2){-n). 



for n e¥p\A, h{n) = 



1, if Fh{n) < L- 
0, if Fh{n) > L. 



The proof of this proposition can be found in subsection 12.51 
We will use this proposition to construct a sequence of sets 



Ai, A 



C F. 



and a sequence of functions 



ri, ra, : Fp 



[0,1], and all E(ri) 



such that the following all hold. 
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• First, I All =2 and Ai+i = AiU {xj+i, 

• second, A(ri) < A(/) + 5p"2|Aj|; 

• third, given particular fixed values for rj(n) on Aj, we have that mini- 
mizes A(rj), subject to the density constraint E(rj) = E(/); 

• and finally, for each n E Ai, ri{n) G [1/4,3/4]. 

Clearly, this process cannot continue past the [p/2jth iteration, as the sets 
Ai grow by two elements after each iteration. Furthermore, we will show that 
whenever the process does terminate (which it will in either case 1 or case 2 
below), we will be left with a function r : Fp [0, 1] satisfying the conclusion 
in the second bullet of Theorem [H 

For the time being, let us suppose that these sequences can be constructed 
as claimed: Suppose we have constructed A^; we will now show how to con- 
struct v4j+i. To this end, we apply Proposition [2] with A = Ai (in the case i = 
we let A be the empty set), and then we deduce that for some L > 0, rj(n) = 1 
(we use ro{n) := f{n)) for Fr^n) < L and rj(n) = for Fr-{n) > L. We fur- 
thermore apply the already-proved first part of Theorem [1] from subsection 
12.11 and deduce that since A(rj) < A(/) + 10ip~^, 

^n\ri{n) -C{n)\ < p{\ogp)-^/\ 

where C{n) is ri{n) rounded to the nearest integer. 

Now, if there are two distinct places x,y E ¥p \ Ai for which 

r,(x),r,(y) G [1/4,3/4], 

then we just let 

Ai+i := {x,y}, andrj+i := r^. 

So suppose that there are no such x and y; there are three possibilities to 
consider. 



2.2.1 Case 1: ri(n) > 1/2 for all ne¥p\Ai where Fr^n) = L. 

Note that we include in this case the possibility that there are no n E ¥p \ Ai 
such that Fr-{n) = L. 

If we are in this case, then it means that 

E„eFp\A, : Fr,{n)=L\n{n) - 1| < p(logp)"^/^; 

and so, rj(n) is very close to 1 at most places n E ¥p\Ai where FrX'^) = L. It 
follows that if we were to let S be the set of all n E ¥p \ Ai with Fj.-{n) < L, 
then 

EnesHn) - 1\ < p(logp)-2/l 
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In order to extend this sum to all n G Fp, we will need to show that \Ai\ 
cannot be too big. Basically, we will show that if it is, then A(/) could not be 
minimal. 

To see this last point, we apply Proposition [1], using A := Ai, h := r^, 
e = 1/4, and e/5 = p~^/^logp, and we deduce that there exists 

g : ¥p ^ [0,1], E{g) > E(r,,), 

and yet 

A(^) < A{f) + 8p-^/Hogp-p-'^Wo/64: + 0{p-^), 

where 

Wo := Ea,a+d,a+2dGAri{a)n{a + d)ri{a + 2d) > 4-VA(A). 
We wish to apply Lemma [D First, let 

N = exp{c{p/\A\f^) < p, 
such that from ([2]) we deduce that 

\Ai\ > 2p(logAr)-2/3 > 2n{\N])p/N, 

as we require. 

From this it follows now from Lemma [1] that 

A(A) > ^3+'o^lr2) > l/A^' » exp(-3c(p/|A.|)^/^), (5) 

for sufficiently large. 

In order for / to minimize A(/), we must have that 

8p"^/^logp = 2(3 > p"^iyo/64 + 0(l/p); 

so, ignoring the 0(l/p), we see that 

K{Ai) < A^p-^Wo < 4V^^^logp. 

It follows from this and that 

I A, I < p(logp)-2/^ 

as claimed. It follows that if we extend S to be the set of all n where Fr- (n) < 
L, then 

Enhin) - S{n)\ < p(logp)-2/3, 
and the second bullet of Theorem [1] is proved upon setting r = Vi. 
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2.2.2 Case 2: ri{n) < 1/2 for all ne¥p\Ai where Fr^n) = L. 
If we are in this case, then it means that 

and so, if we let L' = L — 6, for small enough 6 > 0, then we will have that 
for n E ¥p\ Ai, ri{n) = 1 for Fr-{n) < L', while ri{n) is usually near when 
FrXn) > L'. It follows then that if we let 5* be the set of n G Fp \ where 
FrXn) < L', then 

Eneshin)-1\ < p(logp)-2/3, 

We wish to extend this to where S is the set of all n satisfying Fr-{n) < L', 
by showing that \Ai\ cannot be too big, and we proceed exactly the same way 
as in Case 1 above. We then deduce that, upon redefining S in this way, that 

Enlnin) - S{n)\ < p(logp)-2/3, 
and again this proves the second bullet of Theorem [1] upon setting r = r^. 

2.2.3 Case 3: There exists x,y e ¥p \ Ai where Fr^x) = FrXu) = L, 
and rj(x) < 1/2 < ri{y). 

In order to decide what to do in this case, we will require the following basic 
fact, which is an immediate consequence of the formula for A(/i3) in the proof 
of Proposition [2] in section [231 in equation (fTOj) : We have that if we let 

^ ^ ( ri{n), if n ^ x,y; 

■ \ (ri(x) + ri{y))/2, ii n = x or y, 

then 

Mri+i) < Hri) + P~^iri+i{x) - r,{x))FrXx) + p~\n+i{y) - ri{y))FrXy) + lOp"^ 
< A(ri) + 10p~^ 

So, when we are in this case, we just let 

Ai+i := {x,?/}, 

and so the properties of Aj+i, r^+i that we require all hold. 

2.3 Proof of the third part of Theorem [1] 

We assume for this part of the proof of our theorem that 9 > (log log p)"^/^, 
since our problem is trivial otherwise. 
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We now prove the third bullet of Theorem [H To this end, we let 

where fi is defined as follows: First, we locate the places bi,...,bt where the 
Fourier transform 

where Eq > will be decided later, and then we define the Bohr neighborhood 
B to be all those n E¥p where 

< Eo, for alH = 1, t. 

Finally, we just let = 1/\B\ if n G and /x(n) = otherwise. 
Our goal now will be to show that 

Sj/3H-/(n)| « p(loglogp)-2/^ (6) 

for this will imply the third bullet of Theorem [T] holds: To see this, note that 
from the already-proved first bullet, we know that if we let C{n) be f{n) 
rounded to the nearest integer, then 

^M-\C * B)in) - CHI = ^^\\B\-\f * B){n) - + 0(p(logp)-2/^) 

= E„l/3H-/H| + o(p(iogp)-2/3) 

•C p(loglogp)~^/^, 
which is just what the third bullet claims. 

Now we show that ([6]) holds: First note that Parseval gives 

t < OEf- 

and the following standard lemma tells us that our Bohr neighborhood is 
"large". 

Lemma 2 We have that 

\B\ > (£o + 0(l/p))V 
Proof of the lemma. For i = 1, 2, t, we let 

We note that ai{x) is always a non- negative real for all real numbers x, and 
ai is the Fourier transform of a function jSi :Wp [0, 1]. Furthermore, 

|«i(0)| = Eop + Oil). 
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Now letting 

/3(n) := (A---A)(n), 
we find that : Fp ^ [0, 1], and has support contained within B. So, 

\B\ > m = p-'^\Pi*P2*---*Ptm 

= * ^2 * ■ ■ ■ * at)(0) 

> (£o + 0(l/p))V 



Now, from the easy-to-check fact that 

||/3(a)-/(a)||oo = ||/(a)(l-/i(«))lloo< e^p, 
we easily deduce, via standard arguments (Parseval and Cauchy-Schwarz) that 

A(/3) = P^'Ea/3(a)73(-2a) = p-'T^afiaf f{-2a) + E 

= A(/) + E, 

where the "error" E satisfies 

1^1 < lOeo- 



Now let A be all those n G¥p for which 

Then, we have that 

Wo := j:a,a+d,a+2deAMa)fs{a + d)fsia + 2d) > eyA{A). 
In order to apply Lemma [1] to this, we let 

N = exp{c{p/\A\f^^) < p, 
so that from ([2]) we deduce that 

\A\ > p{\ogNy^^^ > 2rs{N)p/N, 

as we require. 

From this it follows now from Lemma [1] that 

A(^) > ^3 + o^lr2) > » exp(-3(2p/|A|)3/2), 
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for sufficiently large. 

In order for A(/) to be minimal, we must have that 

A(/) < A(/3) < A(/) + 2/3 + 10£o-^?p-'W^o/4 + 0(l/p). 

Setting P = 5eQ we must have 

2Qeo > 6lp-^Wo/2 + 0{l/p) > £?A(A)/2 + 0(l/p); 

and so, 

A(A) < SOeoei' + Oil/p). 
Combining this with our lower bound for A{A) above, we deduce that 

\A\ « pi\ogele^'r'/\ 

It now follows that if C{n) is f3{n) rounded to the nearest integer, then 

Y^nlfsin) -C{n)\ < E„eAl/2 + E„eFp\A^i 
< p{\ogeleoY^/^ + ^iP- 

Now we will set 

Eq := \/ 9 log log p I log p, and e\ := (log log p)"^''^, 
which will give 

\B\ > p^'\ 

and then our sum on |/3(^) — C{n)\ will be at most 

T.n\Hn)-C{n)\ « p(loglogp)-2/3, 
which completes the proof of Theorem [H 

2.4 Proof of Proposition [1] 

2.4.1 Technical lemmas needed for the proof of the Proposition 

We will need to assemble some lemmas to prove this proposition. We begin 
with the following standard fact: 

Lemma 3 Suppose that 5* C Fp satisfies \S\ = ap. Let T denote the comple- 
ment of S. Then, we have that 



A(5)+A(T) = l-3a + 3a 



2 
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Proof of the lemma. One way to prove this is via Fourier analysis: We have 
that 

A(5) + A(T) = p-''Ea{S{ayS{-2a) +f{a)^f{-2a)). 

Since S{a) = —T{a) for a 7^ 0, we have that all the terms except for a = 
vanish. So, 

A(^)+A(r) = p-=^(^(0)=^ + r(0)^) = a^ + {l~a^) = 1 - 3a + 3a\ 



From this lemma, one can deduce the following corollary, which we state 
as another lemma: 

Lemma 4 For a > 2/3 we have that there exists a set S <^ ¥p satisfying 
\S\ = [ap\, and 

MS) < a%l-il-af/2)+0il/p). 

Proof of the Lemma. Let f3 = 1 — a < 1/3, and then let S just be the 
arithmetic progression {0, 1, \_ap\ — 1}, and then let T be the complement 
of S, which is also just an arithmetic progression. It is easy to check that 

A(T) = \T\'/2p' + 0{\T\/p') = PV2 + 0{l/p), 

as the solutions to x + y = 2z, x,y, z G T are exactly those ordered pairs 
{x,z) & T X T of the same parity. 

Applying Lemma [3] to this set T, we find that 

A{S) = (1-3/3 + 3(3^) - [3^/2 + 0{l/p) 
= 1- 3/5 + 5/572 + 0(l/p) 
< (l-/3f(l-/5V2) + 0(l/p), 

as claimed. ■ 

2.4.2 Body of the proof of Proposition [T] 

We will define the function g -.Fp —>■ [0, 1] such that 

support ((7) C AU B, 

where 

for n E B, g{n) = f{n), 
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but on the set A, the funciton g will be different from /: Basically, we let S 
be the set produced by Lemma S] with a = 1 — e, then take T to be a random 
translate and dilate of S, say 

T := m.S + t = {ms + t : s G S*}. 

Then, we let 

for n G A, g{n) = {1 — e)"^ f {n)T (n) . 

Note that this is < 1, because we know f{n) < 1 — e on A. 

We will show that, so long as there are "enough" three-term progressions 
lying in A, this new function g will have the property that A{g) is much smaller 
than A(/). To this end, we consider three types of arithmetic progressions that 
give rise to the counts A(/) and A{g): Those progressions that pass through 
both A and B (say one point in A and two in B; or two in A and one in B); 
those that lie entirely within A; and those that lie entirely within B. 

The contribution to A{g) of those arithmetic progressions lying entirely 
within B is the same as the contribution to A(/). So, we don't need to account 
for these when trying to prove our upper bound on A{g); and therefore there 
are only two non-trivial cases that we need to work out: 

Case 1 (all three points in A). 

Define the random variable 

Zo ■= Yia,a+d,a+2deA9{o)9{a' + d)g{a + 2d), 

and let Wq be the analogous sum but with g replaced by /. We note that if 
we only consider those terms with d ^ 0, we lose at most 0{p) in estimating 

We have that 

E(Zo) = Y^a,a+d,a+2deA¥,igia)gia + d)gia + 2d)) + Oip) 

d^O 

= — £:)^^X]<'."+<*>'»+2rf6A/(a)/(a + (i)/(a + 2(i)X] m,te¥p 1 + 0{p) 

dj^O a,a+d,a + 2dem..S + t 

= p ^(1— £:) ^Yj^'''+<',a+2deAY!ib,b+d'b+2d'eS 

d^O 

E ™.t6Fp f{a)f{a + d)f{a + 2d) + 0{p). 

mb-\-t — a , m(&+d') + t — a + d 

To estimate this inner sum, we note that the contribution of those terms with 
(i' = is 0; and, when d' ^ 0, we get a contribution of f{a)f{a + d)f{a + 2d) 
to just the inner sum, because there is only one pair m, t which works. Thus, 
we deduce from this and Lemma H] that 

E(Zo) = p-\l-6)-^Yt.,+d',,+2d'^sf{a)f{a + d)f{a + 2d) + 0{p) 

a,a + d,a + 2deA 

= il-e)''AiS)Wo + 0{p) 
< {l-e''/2)Wo + 0{p). 
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Case 2 (at least one point in A, and at least one in B). 
Define the random variables 

Zi X]"."+<*GA5f(a)5f(a + (i)5f(a + 2(i) 

a + 2deB 

Z2 := X]'j.<i+2'ieAg((a)(y'(a + (i)5f(a + 2(i) 
Z3 :— Yja+d,a+2deAg(a)g(a + d)g(a + 2d) 

aeB 

Z4 :— y\ aeA q(a)q(a + d)q(a + 2d) 

a+d,a+2deB 

Z5 :— XI g{a)g{a + d)g{a + 2d) 

a,a + 2deB 

Zq :— Yj 'i+'^df^A g{a)g{a + d)g{a + 2d). 

a,a-\-dEB 

Also, let Wi, .... Wq be the analogous constants with g replaced by / (note that 
these are not random variables). 

We will now compute the expectations of these random variables; though, 
we will not do all of these here, and instead will just work it out for Zi, as 
showing it for all the others can be done in exactly the same way, and leads 
to the same bounds. 

We have that 

E(Zi) = Ea+2deB/(a + 2d)E„,„+<igAE(^(a)^(a + d)). 

To evaluate this last expectation, let us suppose that a + 2d & B and a,a + d e 
A, where d Q {ii d — then we would have that a lies both in A and B, 
which is impossible). Then, given any pair of distinct elements x,y & S, there 
exists a unique pair (m, t) E¥p x ¥p such that 

mx + t — a and my + t — b. 

So, the probability that 

gia)gia + d) = (1 - e)-'' f (a) f (a + d) , 

given a + 2d & B, a, a + d E A, is l/p^ times the number of ordered pairs (x, y) 
of distinct elements of 5", which is |>S'|(|5'| — 1). Note that if g{a)g{a + d) is not 
equal to this, then it must take the value 0. It follows that 

E(Zi) = p-^\S\{\S\-l){l-e)-'Wi = Wi + 0{p). (7) 

Likewise for the other Z^, we will have that 

E{Zi) = Wi + 0{p). 
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Collecting the two cases together. 

Let Z-j denote the contribution of arithmetic progressions lying entirely in 
B\ that is, 

Note that in this case W-j = Zj. 

Putting together our above estimates, and using the fact that 

Aig) = p-'(Zo + --- + Zv), 

we find that 

E{A{g)) = p-\Wo + --- + W7-e^Wo/2 + 0{p)) 
= A{f)-e'p-'Wo/2 + 0{l/p). 

Using Markov's inequality we have 

Prob(A(,) < A(/)-.V^l^o/4) > 1 - !^,vVo/4 ^ 

since A(/) > p'^Wq. 

E(g() is close to IE(/) with high probability. 

Before we "derandomize" and pass to an instantiation of g, we will need 
to also show that E,{g) is close to IE(/) with high probability. This can be 
accomplished in several different ways, though here we will just use the second 
moment method: First, let 

F := SaeA/(a), and G Ea€A^(«)- 

Now, as is easy to show, F + 0{l/p) — E(G); and so, since £/3 > p'^^'^logp, 
we have that 

Prob(|F - Gl > 2f3p) < Prob{\G - E{G)\ > (3p). (8) 
It follows from Chebychev's inequality that this last probability is at most 

Var(G') _ E{G^)-E{Gf 

To bound this from above we observe that 

HG') = E„,,eAE(^(a)5(6)). 
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Now, as a consequence of what we worked out just before (jTj), we have that 
g{a) and g{b) are independent whenever a ^h. So, 

E(G2) ^ E(G2)+0(p), 

and it follows that the probability of the right-most event in ((HI) is at most 
0{l3~'^/p). It is easy to see that with probability 1 — 0{(3~'^ /p) we will have 

> E(/)-2/5. (9) 



Conclusion of the proof. 

It follows that with probability at least 

(l-O(rVp)) + e'/S - 1 

we will have that 

E{g) > E{f)-2f3 and A{g) < A{f) - e^p'^Wo/A + 0{l/p). 
Using our assumption that 

ef3 > p'^^"^ logp, 

we have that this probability is positive. So, there exists an instantiation of g 
such that both hold; henceforth, g will no longer be random, but will instead 
be one of these instantiations. 

By reassigning at most 1(5p places a G A where g{a) = to the value 1, we 
can guarantee that Ei^g) > E(/), and one easily sees that 

Aig) < A(/) + 2/3-£V'W^o/4 + 0(l/p). 

This completes the proof of our proposition. ■ 

2.5 Proof of Proposition [2] 

We have that if we define the new function h^ln) = h{n) at all n G Fp, except 
for n = a; and n = y, then 

A{h) = A(/^)+Ei + --- + ^i3, 
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where if we let uj — e^'^'/^, then 



E2 = P''J:ahi(^)\h{x) - = p-\h * h){2x){hs{x) - h{x)) 

E3 = 2p-3S,/i(a)/?.(-2a)(/i3(l/)-/i(|/)K^ 

= 2p-'{h*h2){-y){h{y)-h{y)) 

Ei = 2p-^YiaHo)K-'^o){h3{x) - h{x))uj'''' 

= 2p-\h * /l2)(-x)(/l3(x) - h{x)) 

E, = 2p-^EaM«)(^3(y)-My))'^""' = 2p-2(^3(y)-My))'My) 
^6 = 2p-3Ea^(«)(^3(y) - /i(y))(/i3(^) - h{x))uj<y-^^^ 

= 2p-2(/i3(y) - /i(y))(/i3(^) - h{x))h{2x - y) 
E7 = 2p-'T.am{h3{y) - h{y)){hs{x) - f{x))u<^-^y^ 

= 2p-\hM - h{y)){h{x) - f{x))h{2y - x) 
Es = 2p-^Ea^(«)(^3(^) - h{x)fu;-''^ = 2p-\hs{x) - h{x)fh{x) 
E, = p-'T.aH-'^a){hM - h{y)fuj'''' = p-\h,{y) - h{y)fh{y) 
Eio = p-^E„M-2a)(/i3(^)-M^))V«^ = p-^(/i3(a;)-/i(x))2/i(^) 
En = 2p-3S„M-2a)(/i3(x)-/i(x))(/i3(l/)-/i(l/))a;«(^+^) 

= 2p-2(/.3(a;) - h{x)){hs{y) - h{y))h{{x + y)/2) 
Eu = p-\h{y) - h{y)f 
En = p-\hs{x)-h{x)f. 

There are actually 6 more terms that make up the above "error" ; however, all 
of these give a contribution of 0, which is why they were not listed. 
So, one sees that 

A{hs) = A{h) +p-'{hs{x) - h{x))Fn{x) +p-'{hs{y) - h{y))Fu{y) 

+E5 + --- + En. (10) 



To prove our proposition, all we need to show is that if there is a pair 
x,y E ¥p, x,y ^ A, with 

Fh{x) < Fh{y), and h{y) > 0, 

then in fact 

h{x) = 1. 

Suppose there were such a pair x,y for which h{x) < 1. Then, we will show 
that h fails to minimize A{h) subject to the various constraints: Basically, we 
let 

< £ < mm{l — h{x) , h{y)) 
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(its exact value will be decided later) and then we consider the function 
given by 

for n G Fp, n x,y, we set /i3(n) = h{n); and, 
hsix) = h{x)+e, h^iy) = h{y) - e. 

From our formula ffTOj) . we easily deduce that 

Aih) < Aih)+ep-\F^{x)-Fhiy))-Oie^p-^). 

Clearly, if we take e > small enough, we will get 

Hh) < 

which contradicts the minimality of h. We conclude, therefore, that h{x) = 1, 
as claimed. ■ 
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